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ABSTRACT 

Mutant transcripts containing expanded CUG 
repeats in the untranslated region are a patliogenic 
factor in myotonic dystropliy type 1 (DIVI1). Tlie 
mutant RNA sequesters the muscleblind-like 1 
(IVIBNLI) splicing factor and causes misregulation 
of the alternative splicing of multiple genes that 
are linked to clinical symptoms of the disease. In 
this study, we show that either long untranslated 
CAG repeat RNA or short synthetic CAG repeats 
induce splicing aberrations typical of DM1. 
Alternative splicing defects are also caused by 
translated CAG repeats in normal cells transfected 
with a mutant ATXN3 gene construct and in cells 
derived from spinocerebellar ataxia type 3 and 
Huntington's disease patients. Splicing 
misregulation is unlikely to be caused by traces of 
antisense transcripts with CUG repeats, and the 
possible trigger of this misregulation may be se- 
questration of the MBNL1 protein with nuclear 
RNA inclusions containing expanded CAG repeat 
transcripts. We propose that alternative splicing 
misregulation by mutant CAG repeats may contrib- 
ute to the pathological features of polyglutamine 
disorders. 

INTRODUCTION 

Human triplet repeat expansion diseases (TREDs) are 
caused by abnormally elongated trinucleotide repeats 
located in either coding or non-coding regions of specific 
genes. Although they have a similar type of underlying 
mutation, TREDs represent a chnically heterogeneous 
group of disorders with some shared symptoms (1,2). 
This observation implies that the phenotype for each 
disease is determined not only by the different genes har- 
boring the mutations and their expression patterns 
but also by the mutated sequences themselves or their 



structural features (3-6). A subgroup of TREDs, 
polyglutamine (poly-Q) disorders, is caused by expansions 
of CAG repeats in translated sequences encoding poly-Q 
tracts in multiple functionally unrelated proteins (7). 
Huntington's disease (HD) and several types of 
spinocerebellar ataxia (SCA) belong to this group. 

Diseases caused by expanded repeats located in untrans- 
lated regions (UTR) are another subgroup of TREDs. In 
myotonic dystrophy type 1 (DM1) and type 2 (DM2), a 
crucial role in pathogenesis is played by mutant DMPK 
and ZNF9 transcripts containing expanded CUG or 
CCUG repeats located in the 3'-UTR or in an intron, 
respectively (8,9). Both types of repeats form stable 
hairpin structures (10-12) that are toxic to cells because 
they trigger alternative splicing aberrations in multiple 
genes (13,14), including the insuhn receptor (INSR), 
chloride channel 1 (CLCNl), sarcoplasmic/endoplasmic 
reticulum Ca+^ ATPase 1 and 2 (SERCAl and 
SERCA2), cardiac troponin T (cTNT) and Tau 
{MART). Aberrant expression of these genes is linked to 
clinical symptoms of DM1 and DM2, such as insulin re- 
sistance, myotonia, muscle wasting, cardiac abnormahties 
and cognitive deficits (14-17). The splicing aberrations 
mainly result from an imbalance in the functional levels 
of two antagonistic alternative splicing regulators, 
muscleblind-like 1 (MBNLl) and CUGBPl (18,19). 
MBNLl is sequestered by mutant transcripts and accu- 
mulates in the nucleus in the foim of inclusions in 
both DM1 and DM2 (20). CUGBPl undergoes 
hyperphosphorylation, and its cellular level is elevated 
only in DM1 tissues (20-26). 

The pathogenesis of poly-Q diseases is commonly 
attributed to expanded poly-Q tracts in proteins (27). 
Numerous cellular processes including transcriptional 
regulation, mitochondrial functions and axonal transport 
have been shown to be affected by proteins containing this 
type of mutation (27). Poly-Q expansions contribute to 
neurodegenerative diseases by both gain-of-function and 
partial loss-of-function mechanisms (28). The protein 
sequence context of the poly-Q tract was also shown to 
influence disease specificity (29). Mutant proteins or 
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protein fragments containing poly-Q tracts form both 
nuclear and cytoplasmic inclusions (30,31), however, the 
toxic role of these deposits has been questioned in a 
number of studies (32-34). A contribution of the mutant 
transcript to the pathogenesis of poly-Q diseases has been 
postulated by several authors (35-37) due in part to the 
similar molecular architecture of CAG and CUG repeat 
RNAs (11,38^0) and the ability of MBNLl protein to 
bind to the repeats (41). In addition, in the Drosophila 
eye model of spinocerebellar ataxia type 3 (SCA3), long 
CAG repeats were shown to generate some of the degen- 
erative effects induced by CUG repeats (35). 

In this study, we addressed the question of whether 
transcripts with expanded CAG repeats may cause alter- 
native splicing aberrations similar to those triggered by 
CUG repeats. We performed comparative sphcing 
analysis of a set of genes with misregulated splicing in 
DM1 cells, and we found that expression of long CUG 
and CAG repeats in normal cells triggers similar splicing 
changes. This effect was found for untranslated or 
translated repeats of exogenous and endogenous mutant 
transcripts. In cells expressing transcripts with expanded 
CAG repeats, we detected intranuclear RNA foci 
composed of the sense transcripts that co-localize with 
the MBNLl protein. We conclude that, given the 
similarities between the mutant CUG and CAG repeat 
transcripts in triggering aberrant alternative splicing, the 
RNA-containing expanded CAG repeats may contribute 
to the pathogenesis of HD and SCA3 as well as other 
poly-Q diseases. 



MATERIALS AND METHODS 

Plasmids and synthetic oligonucleotides 

DMPK fragments containing (CTG)74 and (CTG)2oo were 
PCR amplified from DM1 patient genomic DNA. Shorter 
CTG and CAG fragments were generated by chemical 
synthesis and in vitro expansion events (42). All inserts 
were cloned as NotI fragments into the pEGFP-N3 
vector (Clontech) in the 3'-UTR region of the enhanced 
green fluorescent protein (EGFP) cDNA. The 
EGFP-mutATXN3 construct (which drives expression of 
the EGFP-mutant ataxin 3 fusion protein) was also 
prepared in the pEGFP-N3 vector. The stop codon of 
EGFP was removed, and the human ATXN3 sequence 
containing 69 CAG repeats was inserted in frame as 
a Bsrgl/NotI fragment. The insert sequences and 
number of repeats were confirmed by sequencing. 
All RNA oligonucleotides used in this study were 
synthesized by Metabion (Martinsried, Germany). The 
following synthetic RNAs were used: (CUG)?, (CAG)?, 
(CUG)2o, (CAG)2o, (AUG)i7, (UUA)i7, (UGG)i7 and 
pre-miR-136 (43). The short interfering RNA (siRNA) 
duplexes used in this study were the same as described 
previously: ATX N 3 -^.pecific siG16 (44), siMBNLl (45) 
and siLuc (sense 5'-CGU ACG CGG AAU ACU UCG 
AUU-3'; antisense 5'-UCG AAG UAU UCC GCG UAC 
GUU-3'). 



Cell culture and transfection 

HeLa cells were maintained in RPMI 1640 medium 
(Bioshop) supplemented with 10% fetal bovine serum 
(HyClone) and antibiotics (Sigma). The human neuro- 
blastoma SK-N-MC cell line (DSMZ-Deutsche 
Sammlung von Mikroorganismen und Zellkulturen, 
Germany) was maintained in DMEM (Sigma) supple- 
mented with 10% fetal bovine serum, antibiotics and add- 
itional amino acids (Sigma). Fibroblasts from normal 
individuals (GM07492, GM00024, GM0321 and 
GM01650) as well as DM1 (GM03987), SCA3 
(GM06153, GM06151) and HD patients (GM03864, 
GM04208 and GM04281) were obtained from the 
Coriell Cell Repositories and were grown in minimal es- 
sential medium (MEM, Gibco BRL Life Technologies) 
supplemented with 8% fetal bovine serum, antibiotics 
and MEM-vitamins. Cell transfections were performed 
using Lipofectamine 2000 reagent (Invitrogen) according 
to the manufacturer's instructions. 

Generating stable cell lines 

To estabhsh stable clones, HeLa and SK-N-MC cells were 
transfected using Lipofectamine 2000 with Dralll- 
hnearized pEGFP-N3-derived plasmids carrying the 
neomycin resistance marker. Having completed 48 h 
after transfection, cells were transferred to medium con- 
taining 0.6mg/ml G418 (Sigma) to select for stable inte- 
gration of the plasmid. G418-resistant colonies were 
isolated from each transfection. Colonies were expanded 
and screened for exogene expression by £'GF/'-specific 
real-time reverse transcription polymerase chain reaction 
(RT-PCR). Stable clones were maintained in 
G418-containing medium. 

MBNLl over-expression 

HeLa cell lines with the long CAG repeats that were 
chosen for pEGFP-MBNLl transfection (74CAG K3, 
200CAG K3 and 200CAG K4) were seeded on to a 
6-well plate and 24 h after the cells were transfected using 
Lipofectamine 2000 with 0.5 (ig pEGFP-ClMBNLl 
plasmid that bears the sequence of MBNLl protein 
41 kDa isoform. Completing 48 h after transfection, cells 
were harvested and RNA was isolated. The cDNA 
synthesized using this template was further analyzed. 

Fluorescence in situ hybridization and immunofluorescence 

The detection of CUG and CAG transcripts in DM1, HD 
and SCA3 fibroblasts as well as in transfected HeLa cells 
was carried out by Fluorescence in situ hybridization 
(FISH) using Cy3-labeled DNA/LNA probes (CTG)6- 
CT or (CAG)6-CA, which were modified at positions 2, 
5, 8, 13, 16 and 19 with LNA. For FISH and immuno- 
fluorescence (IF), the cells were fixed in 4% 
paraformaldehyde and were permeabilized with 2% 
pre-chilled acetone in phosphate buffered saline (PBS) at 
pH 7.4 for 5min. For ribonuclease (RNase) treatment, 
cells were permeablized and then incubated with 0.01 ^g/ 
ml RNase A (Sigma) for 1.5 h at 37°C. Pre-hybridization 
was performed in 30% formamide and 2x SSC for lOmin, 
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followed by hybridization in buffer containing 30% 
formamide, 2x SSC, 0.02% BSA, 66|.ig/nil yeast tRNA, 
10% dextran sulfate, 2mM vanadyl ribonucleoside 
complex and 2ng/^l DNA/LNA probe. 
Post-hybridization washing was done in 30% formamide 
and 2x SSC at 45°C for 30min followed by Ix SSC at 
37°C for the next 30min. Shdes were either mounted in 
Vectasliield medium (Vector Laboratories, Inc. 
Burlinganie, CA, USA) containing 4',6-dianiidino-2- 
phenylindole (DAPI) or were submitted to IF. For RNA 
FISH in combination with IF, the shdes were treated as 
described above with no DAPI staining. The slides were 
incubated in blocking buffer (3% non-fat milk, 5% 
normal serum and 0.3% Triton X-100 in PBS) for 1 h at 
room temperature, and then incubated with rabbit 
anti-MBNLl primary antibody (A2764) (a gift from Dr 
Charles Thornton) overnight at 4°C. After the reaction, 
the slides were washed in PBS and then incubated with a 
secondary anti-rabbit antibody conjugated to Cy2 
(Jackson Immuno Research Laboratories, Inc.). 
Following PBS washes, the nuclei were stained with 
DAPI in Vectashield mounting medium. All FISH/IF 
images were acquired using a Zeiss Axiovert 200 M micro- 
scope equipped with a cooled AxioCam HRc camera and 
processed with LSM 510 software. 

For DNase treatment cells were fixed and incubated O/ 
N with 70% EtOH. The next day, cells were pre-incubated 
for lOmin in cytoskeletal buffer with sucrose (CSK) 
(10 mM HEPES, 300 niM sucrose, lOOniM NaCl, 3mM 
MgCl2, 0.5% Triton X-100, lOmM vanadyl riboncleoside, 
pH 6.8) [according to Ho et al. (36)] followed by incuba- 
tion for 1.5 h at 37°C with 0.8 U/^il RNase-free DNase I 
(Epicentre) in the CSK buffer. After the reaction, DNA 
was removed by two additional washes with the ice-cold 
cytoskeletal buffer. DNase control slides were incubated 
under the same reaction conditions but DNase was 
omitted in the reaction buffer. 

RT-PCR and real-time PCR 

RT-PCR analysis was performed using standard proto- 
cols. Briefly, 1 ^g of total RNA was reverse transcribed 
using Superscript II reverse transcriptase (Invitrogen) ac- 
cording to the manufacturer's protocols with use of 
random hexamers. All reverse transcription reactions, 
after 10-fold dilution, were subjected to an optimal 
number of PCR cycles that were within the hnear range 
of DNA amplification. PCR was conducted to analyze the 
splice products of several genes specified in Supplementary 
Table SI. RT-PCR products were resolved in 3% agarose 
gel containing 0.5 ng/ml ethidium bromide. The fraction 
of exon inclusion was calculated by dividing the intensity 
of the PCR band corresponding to the exon inclusion 
splice product by the total intensity of the RT-PCR 
products. 

Real-time PCR was performed using a Rotor-Gene 
3000 (Corbett Research) in the presence of SYBR green. 
Optimization of the real-time PCR was performed accord- 
ing to the manufacturer's instructions (Applied- 
Biosystems, SYBR Green PCR Master Mix reagent 
protocol). Appropriate primers were used for the 



detection and quantification of EGFP expression 
(forward 5'-CCGAGATAGGGTTGAGTGTTG; reverse 
5'-CTTTCCCCGTCAAGCTCTAA) and GAPDH ex- 
pression (forward 5'-GAAGGTGAAGGTCGGAGTC; 
reverse 5'-GAAGATGGTGATGGGATTTC). 

Multiplex ligation-dependent probes amplification 

The Multiplex ligation-dependent probes amplification 
(MLPA) assay was designed to assess the relative dosage 
of sense and antisense transcripts of triplet repeat- 
containing EGFP genes. The assay mixture included two 
antisense-specific probes and two sense-specific probes. 
The probe design strategy was as described previously 
(46). Target sequences of all designed probes were 
located within the EGFP sequence in close proximity to 
each other and to the triplet repeat tracts (Supplementary 
data). Probes (sequences are shown in Supplementary 
Table S3 and Supplementary Figure S3) were generated 
by chemical synthesis (Sigma). The MLPA reaction was 
performed as described previously (47), following the 
general directions provided by MRC-Holland (www. 
mlpa.com). Briefly, 5 [i\ of cDNA (prepared as described 
above and diluted 2x) or 5^1 of pEGFP-N3 (7.5ng/|il) 
was incubated at 98°C for 5min. After coohng the 
samples to room temperature, 1.5 |il of probe mix (con- 
taining 1 fmol of each probe) and 1.5 |.il of SALSA hybrid- 
ization buffer were added, and the solution was denatured 
at 95°C for 2min and hybridized at 60°C for 16 h. 
Hybridized probes were ligated at 54° C for 15min by 
the addition of 32 ^il ligation mixture. Following heat in- 
activation, 10|il of hgation reaction was mixed with 30^1 
of PCR buffer, heated to 60°C, mixed with 10 ^1 PCR 
mixture (SALSA polymerase, dNTPs and universal 
primers, one of which was labeled with fluorescein) and 
subjected to 30 cycles of PCR amplification. All reagents 
except the synthesized oligonucleotide probes were 
obtained from MRC-HoUand. Amplification products 
were diluted 1:40 in HiDi formamide (ABI) containing 
1/36 volume of LIZ600 size standard (ABI) and then 
separated by size on an ABI 3100 Genetic Analyzer 
(ABI). Electropherograms were analyzed by GeneScan 
(ABI), and peak height data were exported to an Excel 
table. Excel programs were generated to transform the 
peak height data to normalized values. Briefly, peak 
heights (signals) for each probe were divided by the 
average signal from two sense-specific probes (run-to-run 
signal normalization), and these values were divided by 
corresponding values calculated from the reference 
vector plasmid DNA sample (relative dosage estimation). 
To facihtate comparisons between samples, calculated 
relative values were adjusted to obtain an average signal 
from sense-specific probes that was equal to 1. 

Western blotting 

Cells were lysed with Ix PB homogenate buffer (60 mM 
Tris-HCl, 2% SDS, 10% sucrose, 2mM PMSF). A 
quantity of 20|ig of protein lysate were separated by 
SDS-PAGE and then transferred onto nitrocellulose 
membranes (Sigma). After blocking with 5% skim milk, 
membranes were incubated with primary antibodies. 
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The following primary antibodies were used: anti-MBNL 
(EnCor; diluted 1:5000), anti-CUG binding protein 1 
(CUG-BPl) (Abeam; diluted 1:500), anti-ataxin3 
(Millipore, diluted 1:1000) and anti-GAPDH (Chemicon; 
diluted 1:20000). After incubation with the primary 
antibody, membranes were washed in PBS containing 
0.3% Tween-20 and then incubated with a secondary 
antibody conjugated to biotin. After several washes, 
proteins were visualized using the Sigma Fast BCIP/ 
NBT kit (Sigma). 

Filter binding assay 

The filter binding assay was performed as described pre- 
viously (41). Briefly, various concentrations of MBNLl 
protein ranging from 0.1 nM to 200 nM were incubated 
with 500 fM of 5'-end-labeled CUG and CAG RNA frag- 
ments consisting of 7 or 20 triplet repeats in a reaction 
buffer (100 mM Tris-HCl pH 8.0, 50 mM NaCl, 50 mM 
KCl, 1 mM MgClj). As controls, 5'-end-labeled (AUG)i7, 
(UUA)i7 and (UGG)i7 RNA fragments were used. After 
BOmin of incubation at 37°C, samples were loaded onto 
filters (nitrocellulose and nylon membranes) and 
sandwiches were assembled on the slot-blot apparatus. 
Signal from both filters was visualized by phosphoimaging 
(Typhoon; Molecular Dynamics). The dissociation 
constant values {Kd) were calculated based on three inde- 
pendent experiments using the one-site binding model 
(GraphPad Prism 5.00). 

Statistics 

Statistical significance was assessed by a two-tailed r-test 
using Prism V. 4.0 GraphPad Software (San Diego, CA, 
USA). 

RESULTS 

Cellular models expressing different lengths of 
untranslated CAG and CUG repeats 

We sought to compare CUG and CAG repeats of differ- 
ent lengths in their abihty to trigger aberrant alternative 
splicing characteristic for DM 1 . For this purpose, we used 
HeLa and SK-N-MC cell hues, which express the MBNLl 
and CUGBPl splicing factors at similar levels 
(Supplementary Figure SI). We generated stable clone 
lines expressing CAG and CUG repeats located in the 
3'-UTR of pEGFP-N3 (Figure lA). The recombinant 
exogene encoded 5, 30, 74 and 200 repeats of the CAG 
or CUG motif, and these lengths represent tracts that 
occur in normal (5, 30) and mutant (74, 200) alleles of 
genes implicated in DM1 and poly-Q diseases (28). 
Clones that maintained stable expression of the exogene 
were used for subsequent analyses. The level of exogene 
expression determined by quantitative RT-PCR was 
similar in all selected clonal lines. 

To resolve whether expression of CAG and CUG 
repeats is associated with nuclear retention of the repeat 
RNA, we performed RNA in situ hybridization. We found 
that only mutant variants of either the CUG or the CAG 
repeats (74 and 200 triplets) gave rise to numerous nuclear 



foci in HeLa cells (Figure IB). The abundance of the 
intranuclear foci was greater in cells with the CUG 
repeats than in those with the CAG triplets. No foci 
were detected in cells expressing 5 or 30 CUG or CAG 
repeats. 

Transcripts containing long untranslated CUG and CAG 
repeats trigger similar splicing aberrations in model cells 

For alternative splicing analyses, we chose a set of en- 
dogenous transcripts instead of the exogenous minigenes 
typically analyzed for alternative splicing aberrations in 
DM1 (36). The pool of genes and splicing events that 
are known to undergo aberrant splicing in DM1 was 
limited to those that are expressed in HeLa and 
SK-N-MC cells and have a ratio of splice variants 
suitable for measurement. For MBNLl-sensitive genes, 
these criteria were fulfilled by the INSR (IK), CLCNl, 
SERCAl (ATP2A1) and LDB3 (ZASP) genes in HeLa 
cells and by the CLCNl, SERCAl and MART genes in 
SK-N-MC cells. For analysis of MBNLl-independent 
splicing events, we selected the CUGBP-regulated 
CAPZB gene, which shows an abnormal splicing pattern 
in DM1 patients as well as the ATEl and EHLl genes that 
are known to be alternatively spliced but do not exhibit 
any splicing abnormahties in DM (48). 

The alternatively spliced INSR mRNAs give rise to re- 
ceptors differing in their signaling properties. In normal 
adult skeletal muscle, the INSR mRNA containing exon 
11, encodes an insulin receptor with greater signahng 
activity than that of the INSR mRNA lacking this exon. 
In DM1 skeletal muscle, the fetal isoform of INSR 
mRNA without exon 11 is retained and results in 
compromised signaling (15). We observed a higher 
degree of exon 11 exclusion (—ex.11) in HeLa cells ex- 
pressing 74 or 200 CUG repeats (mean 67 ± 9%) than 
in the cells with 5 or 30 CUG repeats (mean 28 ± 5%). 
A very similar effect was observed for CAG repeats of the 
same length (Figure 2). The CLCNl mRNA expressed in 
skeletal muscle of normal adults excludes exon 7a while, in 
DM1 tissue, the fetal isoform containing exon 7a is 
retained (CLCN 1 +ex 7a), which results in transcript deg- 
radation by nonsense mediated decay (NMD) (49). We 
found that the CLCNl isoform without exon 7a was the 
predominant isoform in non-transfected HeLa cells and in 
cells expressing either 5 CUG or 5 or 30 CAG repeats 
while the isoforms containing exon 7a were more preva- 
lent in HeLa cells expressing 30, 74 or 200 CUG repeats or 
74 or 200 CAG repeats (Figure 2). The same pattern of 
alternative splicing isoform expression was observed in 
SK-N-MC cells (Supplementary Figure S2). In the case 
of the SERCAl mRNA, exon 22 is excluded in only a 
small fraction of the mRNA in normal adult skeletal 
muscle tissue, but it is excluded from most of the 
mRNA in tissue of DM1 patients (16,50). In our model 
system, the control HeLa cells lacking the exogenous 
repeats and those expressing 5 or 30 CUG or CAG 
repeats showed a higher proportion (mean 72 ± 5%) of 
longer SERCAl mRNA containing exon 22. This propor- 
tion decreased significantly with increasing CUG or CAG 
repeat length (Figure 2). In SK-N-MC ceUs expressing 
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Figure 1. Expression of untranslated CUG and CAG repeats and cellular localization of expanded repeat transcripts. (A) Diagram of the 
pEGFP-N3-derived constructs used to generate stable HeLa and SK-N-MC cell lines expressing 5, 30, 74 and 200 of the CUG or CAG repeats 
in the 3'-UTR of mRNA encoding EGFP. (B) Representative RNA-FISH images of HeLa cells cultured in the presence of plasmid encoding 
(CUG)2oo and (CAG)2oo repeat RNAs. Nuclear retention of the mutant transcripts is visible. Nuclei were stained with DAPI (blue); RNA foci are 
red; magnification x60. 



CUG or CAG repeats, the same trend in splicing aberra- 
tions was observed (Supplementary Figure S2). The alter- 
native splicing pattern was also analyzed for the LDB3 
mRNA in HeLa cells and for MAPT mRNA in 
SK-N-MC cells. Consistently, similar changes in the dis- 
tribution of alternative isoforms were observed in ceUs 
expressing CUG and CAG repeats (Figure 2 and 
Supplementary Figure S2). Most of the analyzed clonal 
hnes expressing long CAG and GUG repeats have 
shown splicing misregulation. But not all of them, i.e. 3 
out of 8 CAG HeLa lines and 3 out of 10 CUG HeLa lines 
did not show mis-regulated splicing. Among the genes 
selected as the alternative splicing controls, no substantial 
differences were observed for the CAPZB and ATEl 
mRNAs in the presence of CUG or CAG repeats ex- 
pressed in both cellular models. 

Partial reversion of splicing defects in cell Hnes express- 
ing long tracts of CAG repeats after MBNLl 
overexpression may suggest that observed changes are 
associated with MBNLl sequestration. We transfected 
three Hela cell lines expressing long CAG tracts (74CAG 
K3, 200CAG K3 and 200CAG K4) with 
pEGFP-ClMBNLl plasmid which is an expression 
vector bearing a sequence of 41kDa isoform MBNLl 
protein. Completing 48 h after transfection, the RNA 
was isolated. The analysis of SERCAl and INSR 
endogenes showed that approximately 5-fold 



overexpression of MBNLl produced a 24% increase in 
SERCAl exon 22 inclusion and a 16% increase in INSR 
exon 11 inclusion in cells expressing 200 CAG repeats 
(Figure 3). 

Synthetic short CUG and CAG repeat RNAs trigger 
splicing aberrations 

In the experiments presented above, the shorter CUG and 
CAG repeat tracts were unable to induce the splicing ab- 
errations that were triggered by the longer repeats. To 
determine whether a higher concentration of shorter 
repeats would induce aberrant splicing in cells, we trans- 
fected HeLa cells with 5, 10 and 40 nM of synthetic 
ohgoribonucleotides (ORNs) composed of 7 or 20 CAG 
or CUG repeats. In addition, we used (AUG)i7, (UUA)i7 
and (UGG)i7 as well as control non-repeat ORNs [i.e. an 
siRNA duplex against the luciferase transcript (siLuc) and 
the precursor of microRNA-136]. Treatment with the 
highest 40 nM concentration of (CUG)7 resulted in con- 
siderable exclusion of exons from INSR and SERCAl 
mRNA (the proportion of shorter isoforms increased 
from 27 ± 3% to 45 ± 6% and from 31 ± 2% to 
52 ± 5%, respectively) and inclusion of exon 7 in 
MBNLl mRNA (49 ± 2% to 65 ± 3%) (Figure 4). A 
similar effect was observed when 40 nM of (CAG)7 was 
used. Dehvery of longer ORNs [i.e. (CUG)2o and 
(CAG)2o] at the same concentration to HeLa cells, 
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Figure 2. CUG and CAG repeats trigger similar pattern of the aberrant alternative splicing in HeLa cells. Total RNA isolated from cells of clonal 
HeLa lines expressing untranslated CUG or CAG repeat RNA of designated lengths was used to assess the levels of splicing variant transcripts from 
the INSR. SERCAl. CLCNl. LDB3, CAPZB, and ATE] genes by RT-PCR. DM1 -specific splicing aberrations were observed for the longest (74 
and 200) both the CUG or CAG repeats and only for genes that are known to be regulated by MBNLl. The CAPZB. that is known to be regulated 
by CUGBPl and ATEl control gene that is not disrupted in DM1 patients, did not show significant changes in the proportion of transcript variants. 
The experiments were carried out in triplicate and quantitative results are shown as bar diagrams. The fraction of exon inclusion or exclusion ( ± SD) 
was calculated by dividing the signal of RT-PCR band corresponding to the inclusion/exclusion splice product by the total RT-PCR representing all 
splice products. *P = (0.05; 0.001), **/>< 0.001 compared with the control. 



triggered similar effects on splicing aberrations. When 
four control ORNs were introduced into the cells (UUA 
and UGG triplet repeats, siLuc and pre-miR-136), splicing 
niisregulation of the tested transcripts was not observed; 
however, (AUG)i7 triggered changes in the exon inclusion 
of MBNLl and SERCAl similar to those observed after 
delivery of (CUG)2o and (CAG)2o ORNs. Alternative 
splicing of the control ATEl and FHLl genes was not 
affected by any of the utilized ORNs (data not shown). 
Furthermore, neither (CUG)? nor (CAG)? was capable of 
triggering measurable alternative splicing aberrations at 
concentrations of 5 and 10 nM. 



Poly-Q disease transcripts misregulate splicing in 
transfected cells and patient-derived cells 

Next, we asked whether cells expressing translated CAG 
repeats would recapitulate splicing changes triggered by 
untranslated CAG repeats that were found in HeLa and 
SK-N-MC clonal hues. Therefore, we estabHshed stable 
SK-N-MC clones expressing an EGFP-mutant Ataxin 3 
fusion protein (mutATXN3) (Supplementary Figure S4). 
SK-N-MC clones expressing an EGFP wild-type Ataxin 3 
fusion protein (wtATXN3) (Supplementary Figure S5) 
were established as an additional control. In three out of 
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Figure 3. Overexpression of MBNLl partially reverses splicing abnormalities caused by long CAG repeats in HeLa cells. Control — untransfected 
cells; +MBNL41 lanes — cell lines bearing long CAG tracts transfected with pEGFP-Cl MBNLl plasmid. The level of transcript expressed from 
pEGFP-ClMBNLl was determined by RT-PCR analysis. The experiments were carried out in duplicate and quantitative results are shown as bar 
diagrams. MBNLl overexpression was estimated in comparison to GAPDH expression level. The fraction of exon inclusion or exclusion {±SD) was 
calculated by dividing the signal of RT-PCR band corresponding to the inclusion/exclusion splice product by the total RT-PCR signal representing 
all splice products. *P = (0.05; 0.001), compared with the cells that were not transfected with pEGFP-ClMBNLl plasmid. 



six mutATXN3 clones, we found similar splicing alter- 
ations when compared with those induced by long un- 
translated CAG repeats (Figure 5 and Supplementary 
Figure S2). As shown in Figure 5, out of the three genes 
that were analyzed, two {CLCNl and SERCAl ) showed 
splicing misregulation. No changes were detected for 
control ATEl, which is consistent with the results 
obtained in HeLa and SK-N-MC clonal lines expressing 
long untranslated CAG repeats (Figure 5 and 
Supplementary Figure S2). The scale of alternative 
splicing differed from clone to clone. 

We next addressed the question of whether splicing 
misregulation occurs in cells derived from HD and 
SCA3 patients. Three HD ceU lines expressing 44, 46 
and 69 CAG repeats, two SCA3 lines with 69 and 74 
CAG repeats and three control fibroblast cell lines were 
used to examine the expression of the INSR, SERCAl 
and ATEl mRNA splicing isoforms. Significant changes 
in the exclusion of INSR exon 11 (52 ± 5% in HD and 
38 ± 4% in SCA3 versus 70 ± 2% in normal fibroblasts) 
and SERCAl exon 22 (41 ± 6% in HD and 49 ± 4% in 
SCA3 versus 87 ± 5% in normal fibroblasts) were 
observed only in cells bearing the longest repeat tracts 
(i.e. 69 and 74 CAG) (Figure 6A). To examine whether 
the observed splicing changes were triggered by the 
expanded CAG RNA repeats, we knocked down the 
mutant ATXN3 transcript in SCA3 cells and anticipated 
a reversion of the splicing abnormalities (Figure 6B). 
Indeed, the 3-fold reduction of mutant ATXN3 levels 
through siRNA treatment resulted in partial but signifi- 
cant restoration of exon 1 1 splicing in the INSR transcript 
(Figure 6C). 

Antisense CUG repeat transcripts are unlikely to trigger 
alternative splicing deregulation in cells expressing 
expanded CAG repeats 

To determine whether splicing misregulation, which we 
detected in various cell lines in the presence of CAG ex- 
pansions, was a result of antisense CUG repeat transcript 
co-expression, we designed and generated a strand-specific 
MLPA assay. In this multiplex assay, we used two probes 



specific for sense and two specific for antisense transcripts 
(Figure 7A). The MLPA was carried out for six ceU fines. 
Five of the cell lines were derived from SK-N-MC cells 
transfected with EGFP-N3-derived plasmids encoding the 
foUowing: (CAG)5, (CAG)74, (CUG)5, (CUG)74 and 
mutATXN3. Untransfected cells were used as a negative 
control and pEGFP-N3 DNA was used as a reference for 
the strand-specific MLPA assay. As shown in Figure 7B 
for (CAG)5-transfected cells, the signal from 
antisense-specific probes is very low when compared 
with the signal from sense-specific probes. Similar effects 
were observed for all other cell lines transfected with 
EGFP-N3. Average signals from antisense-specific 
probes account for 0.02 fraction of the signal from 
sense-specific probes (Figure 7C). These results indicate 
that the level of antisense transcript is very low (~2%) 
in comparison to sense transcript in all analyzed model 
cell fines, regardless of the repeat length, type and 
sequence. 

We also used in situ RNA hybridization with both the 
CAG and the CTG probes to test for the presence of 
nuclear RNA foci containing sense or antisense tran- 
scripts. For the FISH assay, we selected human HD and 
SCA3 fibroblasts expressing mutant HTT and ATXN3 
mRNAs containing 44 and 69 CAG repeats, respectively. 
As a positive control for the FISH, we chose DM 1 ceUs in 
which nuclear deposition of expanded CUG repeats 
(CUG'"'P) from mutant DMPK transcripts has been pre- 
viously shown (21,51). The FISH analysis using the CTG 
probe demonstrated that numerous nuclear RNA inclu- 
sions containing the CAG repeats were detected in the 
HD and SCA3 cell lines (Figure 7D, lower panels). We 
did not observe CAG repeat RNA in the nuclear inclu- 
sions in the FISH experiments performed after RNase A 
treatment, whereas, the inclusions remained after DNase 
treatment. The use of CAG probes in the FISH assay 
revealed a lack of CUG RNA inclusions in the same cell 
fines, which may indicate that the HTT and ATXN3 anti- 
sense transcripts spanning the repeat region were not 
present in amounts detectable by FISH (Figure 7D, 
upper panels). For comparison, we detected typical 
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Figure 4. Short exogenous CUG and CAG repeat RNAs trigger 
DMl-specific splicing abnormalities in HeLa cells. RT-PCR products 
of the alternative splicing of MBNLl. SERCAl and INSR genes after 
48-h treatment of HeLa cells with designated synthetic oligoribonu- 
cleotides delivered in 40 nM concentration. Bar graphs show the per- 
centage of the appropriate mRNA isoforms relative to the total amount 
of splice products. Splicing aberrations were observed for the CUG and 
CAG ORNs; (UAG)i7 sequence also triggers some splicing misbalance. 
No changes were detected for control repeats (UAA)|7, (UGG)i7, 
siLuc, or pre-miR-136. 



CUG repeat RNA nuclear foci in DM1 cells in the same 
FISH assay. 

MBNLl co-localizes with mutant repeats in DM1, HD 
and SCA3 cells and shows similar affinity to CAG and 
CUG repeat RNAs 

In DM1, the presence of CUG repeat RNA in the 
intranuclear foci is associated with sequestration of 
MBNLl protein, which triggers aberrant splicing of 
specific genes (17,20,22). To investigate whether similar 
correlations occur between CAG repeat RNA foci 
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Figure 5. Splicing changes in cells expressing mutated AtaxinS 
fragment with 69 glutamine residues. (A) Diagram of the 
pEGFP-mutATXN3 construct used to generate SK-N-MC cell lines 
expressing 69 CAG repeats in protein coding region. The stop codon 
of the EGFP protein was removed and a cDNA sequence of mutant 
human A TXN3 containing 69 CAG repeats was inserted in frame at the 
3'-terminus to induce expression of EGFP-mutant Ataxin 3 fusion 
protein containing 69 Q-residues. (B) RT-PCR products of the alter- 
native splicing of SERCAl, CLCNl and control ATEl gene. 
Quantification of the results is presented in the graphs as the percentage 
of splice products that include or exclude the indicated exons. The 
experiments repeated three times were carried out in three transgenic 
SK-N-MC lines (mutATXN3 1-3). *P = (0.05; 0.001), **/>< 0.001 
compared with the untreated cells (control). SK-N-MC cells with 
MBNLl siRNA knocked down were used as a positive control 
(siMBNLl). 



formation and the splicing misregulation in HD and 
SCA3 cells, we performed combined FISH/ 
immunostaining analysis to visualize localization of both 
CAG repeat RNA and MBNLl. As expected, the expres- 
sion of mutant DMPK mRNA in DM 1 fibroblast resulted 
in co-localization of punctate CUG repeat foci with 
MBNLl (20). The MBNLl splicing factor also 
co-localizes with mutant HTT and ATXN3 transcripts 
containing expanded CAG repeats presented in nuclear 
foci (Figure 8 A). MBNLl protein was present throughout 
the control fibroblasts lacking the CAG and CUG RNA 
expansions. This result indicates that the expression of 
either CUG or CAG expanded repeats may trigger seques- 
tration of the MBNLl protein. 
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Figure 6. SERCAl and INSR undergo alternative splicing changes in 
human HD and SCA3 fibroblasts. (A) RT-PCR analysis of endogenous 
SERCAl, INSR and ATEl mRNA isoforms in fibroblasts derived 
from HD (69 CAG repeats in HTT) and SCA3 patients (74 CAG 
repeats in ATXN3) and healthy individuals (Normal). For comparison, 
normal cells after siRNA-mediated down-regulation of MBNLl expres- 
sion were used (+siMBNLl). A modest, but clearly visible disturbance 
of the balance of the SERCAl and INSR splicing isoforms was 
detected in HD and SCA3 cell lines. MBNLl knock-down resulted in 
analogical but stronger changes between splicing isoforms. In contrast, 
ATEl mRNA expression remained unchanged. The percentages of 
exon 22 and exon 1 1 exclusion relative to the total transcripts 
indicate mean values ± SD of three independent experiments 
*P = (0.05; 0.001), **/'<0.001 compared with results obtained for 
normal fibroblasts). (B) The level of mutant ATXN3 transcript (74 
CAG repeats) in SCA3 fibroblasts treated with three siRNAs: 
control, ATXNS-ipedfic (siG16) and MBNLl-^•,td<c^c (siMBNLl). 
The relative expression level was determined by RT-PCR analysis of 
CAG repeat region of ATXN3 compared to GAPDH amplification 
product. (C) Treatment of SCA3 cells with siG16 resulted in partial 
reversion of INSR splicing misregulation, while MBNLl silencing led 
to significant increasing of mRNA isoform without exon 11. Graph 
shows average values ( ± SD) for three independent transfection experi- 
ments [*/> = (0.05; 0.001)]. 



Next, we determined the ability of MBNLl splicing 
factor to bind CAG and CUG repeat RNAs in vitro. 
Using a filter binding assay (41), we compared the 
binding affinity of recombinant MBNLl protein to the 



numerous synthetic oligoribonucleotides composed of dif- 
ferent lengths of triplet repeats. The results shown in 
Figure 8B demonstrate that MBNLl has similar affinity 
for CUG and CAG repeats with only a slight preference 
for the CUG repeats (^d*'^"°'-*' = 7.2 ± 1 nM versus 
^^(CAG)2o = 11.6 ± 1.8 nM). For shorter (CUG)? RNA, 
the calculated was shghtly lower 

(75:0"^"°''' = 4.3 ± 0.7 nM). The control triplet repeat 
RNAs, (AUG)i7, (UUA)i7 and (UGG)i7, did not bind 
MBNLl under our assay conditions. These results 
suggest that the observed co-localization of expanded 
CAG repeat RNAs with MBNLl in HD and SCA3 cells 
is caused by direct RNA/protein interaction in nuclear 
foci and by depletion of the functional splicing factor in 
the nucleoplasm that leads to splicing misregulation of 
MBNLl -dependent transcripts. 



DISCUSSION 

Recent research has suggested a potential role for mutant 
RNA-containing CAG repeat expansions in the pathogen- 
esis of poly-Q disorders (35,36). In this study, we per- 
formed comparative analysis of spHcing misregulation in 
cells expressing exogenous and endogenous CAG or CUG 
repeat RNA to determine whether the expanded CAG 
repeat transcripts recapitulate some of the toxic features 
of CUG repeat RNA. We have shown that the expression 
of transcripts containing long CAG repeats results in 
RNA foci formation, MBNLl co-localization and 
splicing aberrations similar to those detected for CUG 
repeat transcripts. We demonstrated abnormal alternative 
splicing using cellular model systems designed to express 
untranslated CAG and CUG repeats of various lengths. 
In those cells, we did not observe elevated expression of 
CUGBPl protein (Supplementary Figure S6). The splicing 
defects were also induced by expression of exogenous 
mutant ATXN3 transcripts containing 69 CAG repeats 
in the translated sequence, and they were observed in 
cell hnes derived from patients with SCA3 and HD. 
Given the similarity of the splicing defects caused by un- 
translated and translated CAG repeats, one could specu- 
late that the presence of expanded CAG repeat RNA itself 
is sufficient to induce some pathogenic features of poly-Q 
expansion diseases. 

Recently, involvement of antisense triplet repeat RNAs 
has been implicated in the pathogenesis and progression of 
SCA8, fragile X-associated tremor/ataxia syndrome 
(FXTAS) and Huntington's disease-hke 2 (HDL2) 
(52,53). In this study, we also considered the possibility 
that bidirectional transcription through the repeat region 
resulting in high relative levels of the antisense versus 
sense transcripts could have influence on splicing 
misregulation which we attributed to CAG sense RNA. 
The presence of antisense CUG-containing transcripts was 
tested using a sensitive MLPA assay, and its potential 
participation in RNA foci formation was analyzed by 
RNA FISH. With those assays, we determined that (i) 
the average level of antisense CUG repeat transcripts 
was very low and accounted for about 2% of the CAG 
repeat transcripts, (ii) no antisense CUG repeat RNA foci 
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Figure 7. Identification of sense and antisense repeat transcripts in cells expressing long CUG and CAG repeats. (A) Schematic representation of 
strand-specific MLPA assay design with probes located in the EGFP sequence close to the position of the changeable repeat tract. Signal from the 
probes hybridizing to plasmid DNA represents equal dosage from sense and antisense strands. The reverse transcription (RT) was performed with 
DNase treated total RNA. The signal from probe hybridizing to cDNA representing sense and antisense (a-sense) transcripts is proportional to the 
cellular level of these transcripts. (B) Representative MLPA electrophoregrams show results obtained for: cDNA from untransfected cells where no 
signals from sense and antisense probes are detected (top panel), cDNA from cells transfected with pEGFP-derived plasmid encoding (CAG)5 where 
very low, barely detectable signals from antisense-specific probes (Al and A2) are visible (middle panel) and pEGFP-N3 plasmid DNA, where 
comparable signals from both the sense- and the antisense-specific probes are shown (lower panel). The signal from sense (SI and S2) and antisense 
(Al and A2) probes are indicated. (C) Bar plots show fraction of signal from the antisense-specific probes normalized to average signal from the 
sense-specific probe obtained from five cDNA samples of SK-N-MC cells transfected with the pEGFP-N3 encoding: (CAG)5, (CAG)74, (CUG)5 and 
(CUG)74 and mutATXN3 (left-hand side) and from two samples of pEGFP-N3 plasmid DNA (right-hand side). (D) Representative images of 
RNA-FISH in cultured human fibroblasts using Cy3-labeled DNA/LNA probes (CAG)(,-CA (upper rows) and (CTG)6-CA (lower rows). HD cells 
expressing 44 CAG repeats and SCA3 cells expressing 69 CAG repeats from, respectively, the HTT and the ATNX3 genes show nuclear retention of 
sense transcripts harboring the CAG repeat expansions. Such CAG RNA inclusions were not detected in the nuclei of normal and DMI fibroblasts 
when FISH was carried out with the same CTG probe. The presence of CAG repeat RNA in the nuclear inclusions was confirmed by RNase A (no 
FISH signal) and by DNasel treatment (presence of FISH signal). The antisense transcripts through the repeat region in mutant HTT and ATXN3 
genes were not detected with CAG probe in the nuclear RNA inclusions of HD and SCA3 cells. In this assay, DMI cells were included as a positive 
control to detect ribonuclear CUG repeat inclusions. CAG and CUG nuclear RNA foci (red); DAPI nuclear stain (blue). 
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Figure 8. MBNLl protein binds CAG and CUG repeats in vitro and 
co-localizes with repeat region of mutant HTT and ATXN3 transcripts 
in nuclear inclusions. (A) Combined FISH/IF assay using Cy3-labeled 
CAG probe in DM1 cells and CTG probe in normal, HD and SCA3 
cells as well as rabbit polyclonal MBNLl primary antibody (A2764) 
detected with secondary Cy2-labeled anti-rabbit antibody. Merged 
images show the colocalization of the MBNLl protein with nuclear 
CAG repeat RNA inclusions in HD and SCA3 cells. Such 
colocalization is also demonstrated in DM1 cells with CUG RNA 
foci. In normal fibroblasts, MBNLl is detected throughout the cell. 
DAPI staining (blue), MBNLl (green), CAG and CUG nuclear RNA 
(red). (B) Results of filter binding assay to determine binding affinity of 
MBNLl protein to 5'-end labeled transcripts of (CUG),, (CUG)2o, 
(CAG)7, (CAG)2o, (AUG), 7, (UUA),, and (UGG),,. Affinity of 
MBNLl to CUG and CAG RNA is similar with only shght preference 
for the CUG repeats. 



formation was detected in cells expressing expanded CAG 
repeat RNA and (iii) distinct numerous CAG repeat RNA 
nuclear inclusions were visible in cells expressing mutant 
sense CAG transcripts. These results clearly indicate that 
the presence of minor but MLPA-detectable fractions of 
antisense transcripts did not localize within RNA foci. In 
DM1 cells, the presence of intranuclear foci mediates 
pathogenesis by affecting the accessibility of functional 



MBNLl splicing factor, and, therefore, lack of such inclu- 
sions in tested cell lines strongly implies that misregulated 
alternative splicing was the result of factors other than 
antisense CUG transcripts. 

Earher studies have shown that DM-specific aberrant 
splicing patterns were reproduced in cell cultures in 
which endogenous MBNLl was reduced by siRNAs (45) 
and in Mbnll^^^''^^^ mice that lack most MBNLl protein 
isoforms (54). Consistent with loss of the splicing factor, 
muscleblind knockout mice and transgenic DM1 HSA^"^ 
mice expressing long CUG RNA repeats (LR) from the 
human skeletal actin promoter (HSA) demonstrated a 
similar phenotype, which clearly implies involvement of 
MBNLl in DM pathogenesis (55). Our results reveal the 
occurrence of a DM1 -characteristic pattern of splicing 
misregulation in the presence of CAG repeat RNA, and 
we propose this effect to be MBNLl-dependent and 
CUGBPl-independent. This proposition is consistent 
with the lack of CUGBPl up-regulation and MBNLl 
co-localization with mutant transcripts within nuclear 
foci. Similar splicing defects triggered by CUG and 
CAG repeats could be explained by the results of our 
in vitro analysis of the abihty of MBNLl to bind to 
these repeats as well as by previous reports (18,41,56). 
These results show that CUG and CAG repeats differ 
only slightly in their binding affinity for MBNLl. It is 
important to note, however, that even smaU changes in 
the cellular level of MBNLl and toxic CUG repeat tran- 
scripts can create an imbalance in alternative splicing 
without changing the ability to form nuclear foci as 
observed in the DM1 HSA mouse model (51) (X. Lin 
and C. Thornton, personal communication). This finding 
could explain our observation of splicing misregulation in 
the absence of foci formation using high concentrations of 
short synthetic RNAs (CUG)7, (CAG)?, (CUG)2o and 
(CAG)2o in HeLa cells. Therefore, a high concentration 
of the ORNs compensated for their reduced length and 
triggered the same alternative splicing aberrations as 
mutant repeat RNA. A similar observation was described 
earher in a mouse model in which high expression of the 
DMPK transcript with short CUG repeats induced 
splicing abnormalities characteristic of DM1 (57). This 
unexpected effect of short repeat RNAs on splicing 
abnormalities should be taken into consideration when 
designing an antisense strategy against expanded CUG 
or CAG repeat transcripts. The chemistry of the oHgoniers 
could be an important factor in eliminating their undesir- 
able effect on splicing. Two different types of antisense 
ohgomers composed of CAG repeats, morphoHnos (58) 
and 2'-(9-metyl-phosphorotioates (59), were recently used 
to disturb the expanded CUG repeat/MBNLl interaction 
in vivo and yielded significantly different splicing correc- 
tion effects in the same mouse model of DM 1 . 

Evidence for the contribution of CAG repeat RNA to 
the pathogenesis of poly-Q disorders has been provided by 
two earlier reports (35,36). Ho et al. (36) found that CAG 
repeat RNA mimics the effects of CUG mutant repeats on 
foci formation and MBNLl co-localization. In their 
system, co-transfection of C0SM6 ceUs with plasmids ex- 
pressing long but interrupted CAG tracts and cTNT or 
INSR minigenes did not result in splicing alterations of the 
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two tested pre-mRNAs. The splicing changes were 
detected only for long CUG transcripts. We believe that 
major sources of discrepancy between our results and 
those of Ho et al. (36) were cells used for transfections 
and different responsivness of endogenes and minigenes 
to mis-splicing triggers. In addition, differences in expres- 
sion level between transgenes harboring different repeats 
could account for observed variations in alternative 
splicing abnormahties. 

The contribution of expanded CAG repeat RNA to 
poly-Q toxicity was previously reported by Li et al. (35). 
In their Drosophila model, SCA3 expression of untrans- 
lated RNAs containing CAG repeats of pathogenic length 
was deleterious, indicating that SCA3 neurodegeneration 
may be manifested in the absence of poly-Q protein. 
Altering pure CAG repeat sequences to an interrupted 
tract within the poly-Q encoding region significantly 
mitigated the toxicity of the untranslated repeat RNA. 
Due to the fact that flies expressing pathogenic SCA3 
poly-Q proteins encoded by interrupted repeat transgenes 
developed degenerative but less severe phenotypes than 
those of SCA3 poly-Q pure CAG, Li et al. (35) suggested 
that the RNA contributes to a shift of the toxicity curve 
rather than being solely responsible for toxicity. The 
results of up-regulation of 'musclebhnd' activity on 
poly-Q-triggered toxicity in SCA3 Drosophila were strik- 
ingly different from what has been so far found for CUG 
repeat mutation. Surprisingly, poly-Q-triggered degener- 
ation was enhanced after overexpression of either MblA 
or human MBNLl suggesting that the protein may act on 
the RNA to add to poly-Q toxicity. 

The splicing misregulation of the Clcnl and Mbnll 
genes was recently described in the mouse model of 
Kennedy's disease AR"^*^ in which \he AR gene expresses 
113 CAG repeats from the mouse Ar promoter (60). The 
authors argued that the mis-splicing was triggered by 
mutant protein. However, the possibility that the poly-Q 
protein may exert effects on RNA splicing through the 
disruption of the organization of nuclear bodies seems 
rather unhkely in our case. We do not observe a general 
deregulation of the alternative splicing process and only 
specific mRNAs are affected. Our study shows that the 
expression of mutant transcripts alone is sufficient to 
trigger the mis-splicing events and that splicing deregula- 
tion is CAG repeat-dependent. 

Lastly, as the formation of intranuclear inclusion bodies 
in HD ceUs has been suggested to be a beneficial response 
to the expression of toxic poly-Q (61), a similar protective 
effect might be attributed to nuclear deposition of mutant 
RNA. It is not known whether RNA foci formed by CAG 
repeats represent a fraction of transcripts designated for 
nuclear degradation or whether they become transiently 
blocked in the nucleus as soluble RNA-MBNLl 
complexes that eventually reach the cytoplasm becoming 
available for toxic protein synthesis. The results of this 
and previous studies indicate that dwelhng on toxic 
RNA gain-of-function mechanisms in mutant CAG- 
related disease might help explain some unknowns and 
uncertainties in our search for a cure of complex poly-Q 
disorders. 
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